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Abstract 

We introduce a permutation analogue of the celebrated Szemeredi 
Regularity Lemma, and derive a number of consequences. This tool 
allows us to provide a structural description of permutations which 
avoid a specified pattern, a result that permutations which scatter 
small intervals contain all possible patterns of a given size, a proof that 
every permutation avoiding a specified pattern has a nearly monotone 
linear-sized subset, and a "thin deletion" result. We also show how one 
can count sub-patterns of a permutation with an integral, and relate 
our results to permutation quasirandomness in a manner analogous to 
the graph-theoretic setting. 

1 Introduction 

The Szemeredi Regularity Lemma, a tool developed in the early 1970's in 
service of the combinatorial milestone now known as the Szemeredi Theo- 
rem, has turned out to be one of the most useful tools in graph theory ever 
discovered. In essence, it says that any graph can be approximated by a small 
collection of random-like graphs. This powerful structural characterization 
allows one to answer questions about graphs by taking such a "Szemeredi 
partition" and then addressing the question by using known facts about ran- 
dom graphs. A number of variants of the Regularity Lemma (or Uniformity 
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Lemma, as it is sometimes called) have emerged since the publication of the 
original. Versions of it giving structural decompositions of hypergraphs have 
been used in many contexts, and a few results have addressed the difficult 
case of sparse graph regularity. The reader is encouraged to read the excel- 
lent surveys of Komlos and Simonovits [Hj and Kohayakawa and Rodl [H] to 
learn about how and where the Lemma is used, how it is proved, and what 
its limitations are. 

An idea intimately related to regularity - quasirandomness - was intro- 
duced by Chung, Graham, and Wilson in [2]. They show that a surprisingly 
large collection of random-like properties of graphs are in fact equivalent. 
Then, in a series of remarkable papers, Chung and Graham applied similar 
analyses to hypergraphs, subsets of Z n , tournaments, and other combina- 
torial objects. The following decade witnessed a flurry of generalizations 
and elaborations appearing in the literature, with much of the work explor- 
ing the connections between regularity and quasirandomness. In particular, 
Simonovits and Sos [TT] showed how quasirandomness is equivalent to the 
property of having a Szemeredi partition into pieces whose regular pairs have 
density 1/2. 

The author defined quasirandom permutations in |3] and proved that sev- 
eral classes of simple arithmetic functions almost always give rise to quasi- 
random permutations ((!]). The central paradigm is the same: a large collec- 
tion of natural, random-like properties are mutually equivalent. However, the 
connections with regularity break down in this realm, as it has not been pos- 
sible so far to bridge the worlds of graph quasirandomness and permutation 
quasirandomness. 

In the present paper, we remedy this situation by proving a regularity 
lemma for permutations and analogizing the basic results used alongside the 
graph Regularity Lemma. The main result (Theorem |2J) says that the ground 
set of any permutation may be decomposed into a small exceptional set and 
a bounded number of intervals in the remaining points so that the action of 
the permutation is randomlike on each such interval. Our hope is that this 
tool will help address the nascent realm of "extremal permutation" problems 
and lead to other work analogous to that of Extremal Graph Theory. 

Examples of extremal permutation problems include: 

1. For any permutation r, give a structural description of the permutation 
that avoids r, i.e., cr|j is not order-isomorphic to r for any index set /. 
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The problem of showing that the number of such a is at most exponen- 
tial in the number of symbols is commonly known as the "Stanley- Wilf 
Conjecture", and was recently solved by Marcus and Tardos |lUj . 

2. For a given permutation r, which permutation a has the maximum 
number of "copies" of r, in the above sense? We write the number of 
such copies as A T (<r). This question has seen a number of advances 
in the past ten years, following Herb Wilf's address at the 1992 SIAM 
Conference on Discrete Math. One particularly nice addition to the 
recent literature in this realm is jjj. 

3. Given permutations r and r', what is the expected value of A T ' (a) in the 
space of permutations a chosen uniformly among those permutations 
on n symbols which avoid r? What is the maximum value of A r (er) 
among all those permutations a which avoid r? 

4. Call a sequence of permutations {fli}^, cr, a permutation of symbols 
with rii — > oo, asymptotically k-symmetric if, for each r, a permutation 
on k symbols, A r (<7j) = (^)(1 + o(l))/k\. Does there exist, for all k, a 
sequence which is asymptotically /c-symmetric but not asymptotically 
(k + l)-symmetric? This question of R. L. Graham appears in [3] and 
is open except for k = 1, 2, 3. 

The rest of the paper is as follows. In the next section, we define regularity 
and uniformity for permutations and prove the existence of a regular /uniform 
partition. Then, in Section El we address Problem 1 above with structural 
results about permutations which avoid a given pattern. These results are 
used in Section 0] to show that only a small number of pairs of points need 
be deleted to destroy all copies of a pattern in a permutation which has few 
of them to begin with. Sectional provides a connection between permutation 
quasirandomness and regularity, and a proof of a new characterization of 
permutation quasirandomness. The following section contains a discussion of 
the (asymptotic) pattern counts one can compute given a regular partition of 
any permutation, and the final section contains a full proof of the permutation 
regularity lemma. 
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2 Regularity 



We provide two versions of a permutation regularity result, the latter of 
which appears to be the more interesting and applicable, and we distinguish 
the two settings through the use of the terms "regular" and "uniform." The 
first result, concerning regularity, we state below but relegate the proof - 
which is quite standard - to Section [7| 

We consider permutations to be elements of & n , the set of bijective maps 
from Z n to itself. For a permutation a G & n and subsets S,T C Z n , we 
write p(S,T) = \{(s,t) G S x T : a(s) < t}\, and d(S,T) = p(S,T)/\S\\T\. 
Throughout the rest of this paper, we consider only partitions in which each 
Cj, i > 1, is an interval. Though it is something of an abuse, we will often 
speak of a "partition of a" instead of a partition of Z n . For integers s and t 
and an e > 0, say that the pair (C s , C t ) is e-regular if, for all intervals I C C s 
and J C Ct with |/| > e|C s | and \ J\ > e\C t \, we have 

\d(I,J) -d(C s ,C t )\ < e. 

Then we call V an e-regular partition into k parts if \C S \ = \C t \ for all 
1 < s,t < k, | Co I < an d (Cs, Ct) is an e-regular pair for all but ek 2 pairs 
(s,t). (If P has only this first property, it is called equitable.) 
Our first theorem is the following. 

Theorem 1 (Permutation Regularity). Given m G N + and e > 0, there 
exist M = M(e, m) and N = N(e, m) so that any a : Z n — ► Z n has an 
e-regular partition into k (nonexceptional) intervals with m < k < M if 
n>N. 

Note that this statement is very similar to the one gotten by taking 
applying the "standard" Regularity Lemma for graphs to the bipartite graph 
whose color classes are two copies of Z n , and so that there is an edge from 
s to t if a(s) < t. The difference lies primarily in that the blocks of the 
partition must be intervals, and the two partitions of the color classes are 
actually the same. 

We now prove a reformulation of this result which will be easier to use for 
some applications. Let C(S, a), for a set S C Z n and a G [0, 1], denote the 
fraction of elements of S whose image is less than an, i.e., |cr(5)n[0, an)!/^). 
We say that two functions /, g : [0, 1] — > [0, 1] are e-near if, for each a G [0, 1], 
g(a—e)—e < f(a) < g(a+e)+e. (We employ the convention that g (a) = g(0) 
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for a < and g(a) = g(l) for a > 1.) It is easy to see that this definition is 
symmetric in / and g. 

Now, we say that a partition {Cj}j =0 of a is (e, jF)-uniform, where T = 
{/«}*=!, if it is equitable, \Co\ < en, and, for each s G {1, . . . , k} and every 
interval I C C s with |/| > e|C s |, £(/, •) is e-near / s . 

The following theorem, which we consider to be the main one of this 
paper, says essentially that permutations are, up to small deviations, con- 
catenations of "deterministic" maps (ones which send all points into just a 
few small intervals) and "random" maps (ones which resemble the original 
map on each subinterval) . Note the absence of any "exceptionality" other 
than the exceptional set itself, in contrast to the Graph Regularity Lemma, 
where exceptional pairs are unavoidable. 

Theorem 2 (Permutation Uniformity). Given m G N + and < e < 1, 

there exists M = M(e, m) and N = N(e, m) so that, if n > N, a : Z n — ► Z n 
has an (e, ^-uniform partition {Cj}j =0 , with m < k < M, where T is a 
collection of k nondecreasing C°° functions fj : [0, 1] — > [0, 1]. 

Proof. Without loss of generality, we may assume that e < 1/2. Apply 
Theorem ^ and take an e 2 /4-regular partition of a so that each Cj, j > 1, 
has cardinality < en/ '4. (We may always do so by choosing a partition 
with even higher regularity if necessary.) Note that there can be at most 
ek/2 indices s G [k] so that there are more than ek/2 indices t with (s,t) 
being e-irregular. Call all other s "good", and add each "bad" Cj to Co to 
create a new partition of a. Then the new exceptional set has size at most 
e 2 n/A + (ek/2)(n/k) < en. 

Fix a good s, and let A be any subset of Z n . Now, suppose x, y G Z n 
have the property that there is some Ct C [x, y). Then 

laMUM) e A x C t : a{s) < t}\ > \a(A) n [0,x)|, 

so d(A,Ct) > C(A,x/n). Similarly, d(A,C t ) < C(A,y/n). In order to guar- 
antee that there is such a Ct and that (s, t) is regular, it suffices to ensure 
that the gap between x and y is at least 

(ek/2 + l)|Ci| + e 2 n/ 4 < W 2 + W 4 + e ^/ 4 < en > 

since it should be the length of ek/2 + 1 Cj's plus all the points of Co- 
Therefore, if we set y = (a + e)n, x = an, we have 

C(X, a) < d(X, C t ) < C(X, a + e). 
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On the other hand, we may take z = (a — e)n, and there will be a C t > C [z, x), 
so that 

C(X, a-e)< d(X, C t >) < C(X, a) < d(X, C t ) < C(X, a + e). (1) 

If we take X = I, an interval of C s of length at least e|C s |, then we may 
apply (|TJ) to get 

d(I,C t r)<£(I,a)<d(I,C t ). 
Then, using the regularity of the partition, we see that 

d(C s , CV) - e < C(I, a) < d(C„ C t ) + e. 

Applying (0) once more, this time with X = C s , 

£{C S , a-e)-e< £(J, a) < C(C S , a + e) + e. 

Since this analysis works for any a G [e, 1 — e), and the conclusion holds 
trivially otherwise, we may take f s (a) = C(C s ,a). 

Note that jC(C s , a + 1/n) — C(C S , a) < \Cs\~ 1 < IkrT 1 . It is easy to see, 
then, that by choosing n large enough we may assume that all of the f s are 
C°° and monotone. □ 

3 Pattern Avoidance 

Define A t (<t) for r G & m and a G & n to be the number of occurrences of 
the pattern r in a, i.e., the number of "index sets" {x < . . . < x m _i} C 7L n 
such that o{xi) < <r(xj) iff r{i) < r(j). 

Suppose that a G & n has a uniform partition P, and r G & m . If it 
is known that A T (cr) = o(n m ), what can be said about the / s ? In fact, 
something quite strong: that it concentrates almost all the mass of a(C s ) in 
at most m — 1 very small intervals. 

Theorem 3. Suppose a G & n , r G & m , < e < (2m)" 1 , and n is 
sufficiently large. Choose {Cj}j =0 , an (e, JF) -uniform partition of a. If 
A T (cr) < (en/2km) m , then, for each 1 < s < k, there is a collection X of 
at most m — 1 disjoint intervals in [0, 1), each of length at most 6e, so that 
\a(C s )D(n-\jX)\ > \C s \(l-7me). 
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Proof. Write T = {f s } k s=v First we prove a claim: if J , . . . , J m -i are disjoint 
intervals of [0, 1) which are separated from each other by at least 4e, then, 
for some t, we have 

/ s (sup J t ) - fsiyal J t ) < 5e. 

To see this, suppose the contrary, i.e., that there are m such intervals for 
which ,/ s (sup J t )—f s (iai J t ) > 5e. Then split C s into m intervals C°, . . . , C™ _1 
whose sizes differ by at most 1, and denote their density functions by /!(•) — 
£(Cf , ■). Writing x f = inf J f and ?/j = sup Jt, we have 

//fo + 2e) - /*(x t - 2e) > (£(C S) ^ + e) - e) - (£(C a , x t - e) + e) 

>(f s (yt)-e)-{f s (x t ) + e)-2e 
= fs(yt) ~ fs(x t ) - Ae > e, 

since |C||/|C 8 | > e. Define x' t = max{0,x t — 2e} and y' t = min{l, y t +2e}, and 
note that the intervals { J' t = [x' t , y' t )} are disjoint, by the separation property 
of the Jt- Then the fact that fs(y' t ) — f%(x' t ) > e for each q and t implies 
that \a{Cf) n J' r{q) \ > e|C|| > en/2km. If we take any ^ e C| n 
then ^o, . . . , Zm-i is a r-pattern in a, so we have at least (en/2km) m such 
patterns, a contradiction. 

Now, consider the following process: begin at 0, and find the first r so 
that f s (r) = 5e (or r = 1 if such a point does not exist). This is possible 
because f s is monotone and continuous and / s (l) = 1. Define J = [0, r). 
Then, let Jq = [r, r + 4e). Now, begin at r + 4e, find the first r' so that 
fs{r') — fsij" + 4e) = 5e (or r' = 1, again, if this is not possible), and define 
I\ — [t + 4e, r') and /{ = [r',r' + 4e). Then define r", I2, and similarly, 
and so on. This process must terminate in no more than [l/(5e)] steps, at 
which point the right-endpoint of the last interval defined is 1. In fact, it 
must terminate even sooner, by the claim above: if we have reached I' m , then 
Jo, ... , I m -i provide a contradiction. Then the l{ number at most m — 1 
and each has length at most 4e. Now, define I" to be the interval with left 
endpoint x'( = minjinf J/ + e, 1} and right endpoint y" = maxjsup I\ — e, 0}. 
Then 

C(C s ,y'{) - C{C s ,x'l) < (Usuph) + e) - (/.(inf/,) - e) < 7e. 

Therefore, the intervals which comprise the complement of 1J Z I", each of 
which contains some //, satisfy the conclusions of the theorem. □ 
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Define a permutation o G & n to be m-universal if A r (cx) > for each 
r G (5 m . Now, we say that a permutation ex has the (5, e, 7)-property if, 
for every interval / with |J| > 5n and every interval J with |J| < en, we 
have | cr(Z) n J| < 7|/|. That is, no sufficiently large interval is mapped too 
densely into any small interval. Our next result says that, for the appropriate 
parameters, this property implies universality. Note that, if we had instead 
stated that D J\ > when \J\ > en, this would be immediate. With 
the reverse inequalities, however, it is far from obvious. On the other hand, 
if <$7 > e, the statement would be vacuous. Therefore, in particular, it has 
content whenever 5 < e. 

Proposition 4. For each m > 2 and e > 0, there is a positive 5 < e so 
that, forn sufficiently large, if a G & n has the (5, e,m~ l ) -property, then a is 
m-universal. 

Proof. Suppose the contrary, so that there is some r G & m with A r (a) = 0. 
Take e' = min{e/6, m~ 2 /8}, and choose an (e', J 7 ) -uniform partition {Cj}j =0 . 
Let C s be any block of the partition. Then, by Theorem |21 at least |C S |(1 — 
7me')/(m — 1) points of / = C s are mapped by a into some interval J of 
length at most en. However, if we take 5 = 1/k and 7 = l/(m — 1), then the 
fact that a has the (5, e, 7)-property provides a contradiction, since |/| > 5n, 
J < en, and 

\a(I) n J\ l-7me 1 

1 Y . — - > > — . 

|/| m — 1 m 

□ 

Now, we show that any permutation which avoids a given r has a lin- 
ear sized subpattern which is "nearly monotone". (Compare to the Erdos- 
Szekeres Theorem, which says that any permutation on n symbols has a 
y^-sized truly monotone subpattern.) Define a permutation p G & r to be 
S -pseudomonotone if either A^ 01 ^(p) < 5(Q or A < - 10 ^(p) < #(2). Then we have 
the following. 

Proposition 5. For every 5 > and r G & m , there is a c > so that, for 
any permutation a G & n which avoids r , with n sufficiently large, there is a 
set I C Z n with |/| > cn so that a\i is 6 -pseudomonotone. 

Proof. We may assume m > 2, and fix e = t]/14m with r\ < 1. By Theorem 
a has an (e, jF)-uniform partition so that, for each 1 < s < k, there is an 
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interval I s of length at most 6e so that \a(C s )r\(n-I s )\ > |C S |(1— 7me)/(m— 1). 
Order the C s left-to-right. Suppose that T of the I s intersect some fixed I t . 
At least 

^ 1 — 7me ^ T(l — 7me)(l — e)n 
m — 1 s — fc(m — 1) 
points are mapped by <r into an interval of length at most 18en. Therefore, 

18efc(m-l) 
£ (l-7me)(l- < ^ 

Hence, we may iteratively pick si, . . . , s^-i/ 6 ] so that the 7 S are mutually 
disjoint. By the Erdos-Szekeres Theorem, there is a subset s' x < ■ ■ ■ < s' R 
of these s^- of size at least [r/~ 1 /6] 1 ^ 2 which is monotone with respect to 
the obvious ordering on the I s .. Let X be the union of the cr _1 (/ s ' ) fl C s i.. 

i-m-i ^ 3 3 

The only pairs of elements of X which possibly display the opposite ordering 
to that of the intervals I s '. are contained within a single set of the form 
fl C s ',. The fraction of pairs in X of this type is at most 

R( ]C 9 sl ) 4(m-l) 2 1/2/ x2 
^ < V - ; < 10r/ 1/2 (m - l) 2 . 



^R\C s \/2(m-l)^ - ft 

If we let r] = 5 2 {m — 1)~ 4 /100, then the set X is 5-pseudomonotone and has 
cardinality at least n ■ (20(m — 1)/Sk). □ 



4 Destroying Patterns 

With the graph regularity lemma, one can prove that, if a graph G contains at 
most o(n m ) copies of some m-vertex graph, then we may remove o(n 2 ) edges 
to destroy all copies. Is there any hope of proving something analogous for 
permutations? 

The first observation to make is that this is certainly not possible if one 
wishes to delete elements of the ground set. Consider the permutation 

a = (1, 0,3,2, 5,4,. . .,n- l,n-2) 

for n even. It is clear that, even though this permutation has A 10 (a) = 
0(n) = o(n 2 ), one must remove Q(n) points to destroy all copies. Further- 
more, the generalization of this construction to other patterns is a simple 
matter. 
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Clearly, deleting points of the ground set is not the proper analogue of 
removing edges. Let us instead attempt to "delete" pairs of points. We wish 
to choose a subset S C ( z 2 ™) so that every copy of r in a contains (in its 
index set) both points of some element of S. To state it another way: if we 
do not count index sets in which pairs from S appear, there are no copies 
of the pattern r. Any copy of r containing such a pair we say is destroyed 
by the deletion of S. The main result of this section says that, using o(n 2 ) 
such deletions, we may destroy all copies of r in a permutation a which has 
A t (<t) = o{n m ). 

Proposition 6. Suppose that a G & n , r G & m , and A T (cr) = o(n m ). Then 
we may delete at most o{n 2 ) index pairs to destroy all copies of r. 

Proof. Take an (e, JF) -uniform partition {Cj}j =0 , e < (2m)" 1 , and choose n 
large enough that A r (cr) < (en/2km) m . By Theorem El for each 1 < s < k, 
there is a collection X s of at most m — 1 disjoint intervals {J|} in [0, 1), each 
of length at most 6e, so that \o~(C s ) PI (n ■ [jl s )\ > \C S \(1 — 7me). We create 
a new collection of families X' s of intervals as follows. Begin with the X s . If 
an interval n -X s receives fewer than e|C s | points of C s under the action of a, 
we remove it from the collection. Then each X' s has at most m — 1 elements 
and \a(C s ) n (n • {JX' S )\ > \C S \(1 - 8me). 

Now, delete all pairs which contain at least one point of Cq or a point 
whose image does not fall into any of the n ■ X' s . There are most (8m + l)en 2 
of these. Then, delete all pairs which contain two points from any one of the 
sets C s . This uses at most k( ]c 2 sl ) < n 2 /2k pairs. Finally, delete all pairs 
whose elements are mapped to points within 12en of each other by a. There 
are at most 12en 2 of these. Hence, letting e — * and k — > oo, the result 
follows if we can show that the chosen deletion indeed destroys all copies of 

T. 

Suppose not. Then the index set on which r appears, %q < . . . < i m -i 
must have the following properties: 

1. For each r = 0, . . . , m — 1, o~(i r ) £ n ■ P s for some s and j. 

2. For each q — 0, . . . , m — 2, a(i T -i^) < a(i T -it q+ r)) — 12en. 

3. If a(i r ) E n ■ P s and cr(v) G n • P s , , then s ^ s'. 

Since each of the P s have diameter at most 6e, the first two properties imply 
that the n ■ P s must be disjoint. Order these (dilated) intervals by increasing 
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s, i.e., s < • • • < s m _i, and call them J , . . . , J m _i. Because they are disjoint 
and a(i r ) G J r for each r, the intervals themselves are ordered like a copy of 
r. Therefore, since the indices s are distinct, for any set of m indices drawn 
one from each of a" 1 (Jo) H C So) ■ " • > <7 _1 (^m-i) H C Sro _ 1 , a restricted to this 
set is a copy of r. This ensures that 

A» > (e\C s \) m > (en/2k) m , 

a contradiction. □ 



5 Quasirandomness 

In jHj, the author proves that a number of random-like properties of permu- 
tations are equivalent to one another. In order to state the main result of 
that paper, a few definitions are necessary. Fix a permutation a G & n . For 
any S, T C 7L n we define the discrepancy of S in T as 

IMS) 

n 

and we define the discrepancy of a permutation a by 

D(o-) = maxDj(«7(J)), 
j, j 

where / and J vary over all intervals of Z n . Also, define 

D*(tr) = max Dj(a(I)), 

where 7 and J vary only over "initial" intervals, i.e., intervals of the form 
[0,x). 

We say that a sequence {<Tj}~ 1 of permutations of Z ni , Z n2 , ... is quasi- 
random if -D(cjj) = o(rij). Often the indices are suppressed, and we simply 
say that D(a) = o(n). 

By e(x), we mean e 2wtx . We also use the convention that the name of a 
set and its characteristic function are the same. The following is a portion 
of the main theorem in [3]. 

Theorem 7. For any sequence of permutations a G & n , integer m > 2, and 
real a > 0, the following are equivalent: 
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[UB] (Uniform Balance) D(a) = o(n). 

[UB*] (Uniform Star-Balance) D*(a) = o(n). 

[SP] (Separability) For any intervals /, J, K, K' C Z n; 



Y, I(x)J(a(x))-- J ^) J ^) 



x€K,y£K' 



o{n) 



[mS] (m- Sub sequences) For any permutation r G & m and intervals I, J C Z n 
wtt |/| > n/2 and \ J\ > n/2, we have \I fl a~ 1 (J)| > n/4 + o(n) and 



/no-^j), 



1 /|a(/)nJ| 



m 



[2S] (2- Sub sequences) For any intervals I, J C Z n |/| > n/2 and | J| > 
n/2 ; we have \I fl <7 -1 (J)| > n/4 + o(n) and 

A (01) (^l/n CT - 1( j)) " A (10) H/n CT - V )) = o(n 2 ). 

[E(a)] (Eigenvalue Bound a) For all nonzero fceZ„ and any interval I, 

e{-ks/n) =o{n\k\ a ). 

sea(I) 



[T] (Translation) For any intervals I, J, 



W k( /)n(J + *)|-HMY 



o(n 3 



Furthermore, for any implication between a pair of properties above, there 
exists a constant K so that the error term e 2 n k of the consequent is bounded 
by the error term e^n 1 of the antecedent in the sense that e 2 = 0(ef). 

In [TT], the authors connect graph quasirandomness and regularity by 
showing that, essentially, a sequence of graphs is quasirandom if and only 
if they possess density 1/2 regular partitions with arbitrarily small e. Here, 
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we prove an analogous result for permutations. Let 0\{x) denote some real 
number whose absolute value is at most x. 



Proposition 8. A sequence of permutations o~i G & ni , i > 1? n % — > oo, 

is quasirandom if and only if, for each e > 0, given any (e, {f s })- uniform 
partition of with i sufficiently large, f s (x) is 2e-near id(x) = x for each s. 

Proof. Suppose that a is quasirandom, and let P be an (e, jF)-uniform par- 
tition. For any interval C s , [UB] implies that 

C{C s ,a) =a + o(l). 

Therefore, if we choose large enough, we may ensure that \C{C S) a) — a\ < e 
for all a e [0, 1], which immediately implies that f s is 2e-near id for each s. 

On the other hand, suppose Oi has an (e, {/ s })-uniform partition P = 
{Cj}j =0 for all sufficiently large i, where f s is 2e-near id for each s. It is easy 
to see that this implies (by sub-additivity) that P is (3e, {id})-umiorm. We 
may assume that the Cj are ordered left-to-right. Choose [0,x], [0,y) C Z ni . 
If, for some s, Cj C [0, x) for all 1 < j < s, choose the largest such s, and let 
X = [0, x) n C s+l (or if s = k). Otherwise, let X = [0, x) and s = 0. Then 



[0, x) = I [J cAuXUE 

\l<j<s J 



for some E C Cq. Therefore, s\C\\ + |X| = x + Oi(enj). We may write 



a([0,x))n[0,y) 



U ^(Ci)n[0,y) 

l<j<s 



+ \a(X)n[0,y)\ + \a(E)n[0,y)\. 



Clearly, \cr(E) H [0,y)\ = Oi(eni). Define [3 = yjn^ Applying uniformity, 
then, 



a 



([0,x))n[0,y) 



\C 1 \£(C j ,p) + \X\£(X,p) + 1 (en i ) 



l<j<s 



\Cx\(P + Oi(3e)) + \X\(/3 + Oi(3e)) + Oil 

i<j< s 

(s|d| + |X|)/3 + 1 (4en i ), 
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so we may conclude that 



\o 



([0,x))n[0,y)| 



xy 
fit 



< herij. 



If we take e — > 0, then crj is quasirandom, by [UB*]. 



□ 



6 Counting Subpatterns 

We wish to count how many occurrences of the pattern r G & m appear 
in a permutation a of Z n with a given (e, jF)-uniform partition {Cj}^ =0 . 
Unfortunately, we have no control over the structure of the exceptional set 
Co, so it is not possible to get an "exact" count this way. Nonetheless, if we 
write X = [jj Cj, it is easy to see that, for n > e _1 > 2, 



X 
m 



< 2en m /(m 



(2) 



so that a count of the r-patterns on X is going to be close to the same count 
on all of Z n . Note that we may also ignore all but the set of occurrences of 
r all of whose symbols occur in different C/s, since the number of these is 



l , which is off from f n ) by at most 



711 



\C | m 



< 2en m /{m 



(3) 



for n sufficiently large. 

There is an additional obstruction to counting patterns that is more subtle 
than these two issues. Suppose the mass of cr(C s ), for some s, were very 
tightly concentrated in some interval. If we use f s as an estimate of its 
density function, then, since the condition of e-nearness can "dislocate" the 
entire mass of a{C s ) by up to e, the counts could be off by a significant 
amount. On the other hand, only index sets whose images have two points 
close to one another can be affected in this way. Since there are few of these, 
with some work, we are able to ignore them in the total count. 

One way ensure that the counts are accurate is simply to posit that f s 
does not concentrate its mass too tightly. Therefore, define f s to be (B, e)- 
Lipschitz if, for each x, \f s (x + e) — f s (x)\ < Be. For example, if we have a 
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(quasi-) random permutation, we may take f s (x) = x for each s, a function 
which is (1, e)-Lipschitz for each e. 

The following lemma makes this idea rigorous. Define 

T) r {(3) = {x = (x u . . . , x r ) G [0, (3) r : Vi, 1 < % < r, x { < x i+1 }. 

Lemma 9. Let fj,gj, j — 1, . . . ,r, be cumulative distribution functions on 
[0,a], a > 0, and suppose that for each j, fj is e-near g,y If gj is (B,e)- 
Lipschitz for each j , then 

< r{a + 1)(B + l)e. 



a(xi) dF r — / a(xi) dG r 

3T(/3) JV r ((3) 

for any < (3 < a, where dF r = dfi{x\) ■ ■ ■ df r (x r ), dG r = dgi(x±) ■ ■ ■ dg r (x r ), 
and a : [0, a] — > [0, 1] is any nondecreasing function. 

Proof. We prove the result by induction, by repeated application of integra- 
tion by parts. First, we check that it is true for r = 1. Note that, since fj is 
e-near gj, 

fj(P) = 9i(P) + 1 ({B + l)e) 
by the Lipschitz property, and we have 

a(xi)dfi(xi) = / a(xi)d/i(xi) 



s 1 ^) Jo 



<*(P)fi(P) ~ I h{xx)da(x x ) 



o 



a(f3)g 1 (f3) + 1 ((B + l)e) 

\giM + O^B + l)e)]da{zi) 



/ a(xi)d^i(xi) + 0i((a + l)(5 + l)e). 
Jo 



Now, suppose the result holds for r — 1, with r > 1. Then 
/ dF r = / / oi(xi) dF r _i df r (x r ) 

JT> r (f3) JO J^- 1 (x r ) 

( [ a(x 1 )dG r _ 1 + d((r - l)(a + 1)(£ + l)e) 
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& r 

/ a{x x ) dGr-x df r (x r ) + d((r - l)(a + 1)(5 + l)e) 

The function oii(x r ) = J^y—iu \ a ( x i) dG r _\ is nondecreasing, nonnegative, 
and bounded by 1, so we may apply the r = 1 case to get 



□ 



dF r = / dG r + 1 (r(a+ l)(B + l)e) 



We wish to be able to count subpatterns in permutations which do not 
necessarily have the Lipschitz property, however. In order to be able to use 
this result, we have the following Lemma which says that convolving the c.d.f. 
of a permutation with a uniform distribution on a short interval preserves 
nearness and gives us a Lipschitz property. Therefore, fix 5 > and, given a 
c.d.f. / on [0, 1], define fit) = 5" 1 f° s f(t + s)ds, a c.d.f. on [0, 1 + 5}. 

Lemma 10. If f and g, c.d.f. 's on [0, 1], are e-near, then f and g are e-near. 
Furthermore, f is (2e5~ 1 ,e)-Lipschitz. 

Proof. To see the first claim, we write 

g(t + e) - fit) = 5- 1 / g{s + e) - f{s) ds < e. 

Jt-s 

For the second claim, 

f(t + e) - /(*) = 5~ 1 ( f(s)ds- [ f(s)ds 

\Jt+e-S Jt-8 

(rt+e rt+e-S 
J t f{s)ds- J ^ f(s)ds 



< 2e5-\ 



□ 



Now, fix a permutation r e & m . Write 25 for £> m (l), £>' for D m (l + 5), 
and define a differential form dujf on [0, l) m as follows: 



m— 1 



l<s <...<s m _i<fc j=0 

We define da; 9 , etc., analogously. Then we have the following. 
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Theorem 11. Suppose duf is defined as above, e < 1/2, andn is sufficiently 
large. Then 



A r fa] 



doot 



< (20e 1/2 m 2 + A/k)n m /{m - 1)! 



Proof. Define gj(x) = £{Cj, x). Suppose that m elements Xj G Z n are chosen 
uniformly at random, X{ G Cj., for some sequence j < . . . < j m -\. Writing 
j' r = j r -i( r ), the probability that their images under a form a r is precisely 



3 



Xjji ) 



because dgj represents the distribution of the images o~(xi) for Xi chosen at 
random from Cj, and we wish to compute the probability that 



a{x fo ) < ■■■ <o-(x fm _ iJ 



Multiplying by the number \Ci\ m of m-tuples, adding over all subsets {jj}, 
and accounting for J2J) and (jHJ) yields 



A T (a) - / du g \ < Aen m /(m-l)\ 



(4) 



By Lemma l^and Lemma ITU1 with 5 = e 1 / 2 , the quantity 



d 9f (si) • • • dg frn l (x m ) - / df fo {x x ) ■ ■ ■ df fm i (x m ) 



Si' 



is bounded by m(2 + e 1 / 2 )(2e 1 ^ 2 + l)e < 9me. Summing up again, we find 



Now, let dG = k~ l ^ k - =l dgj and dG = k^ 1 ^2^ =l dcjj. If we choose an m- 
tuple of points from each of these distributions, the distributions of their 
orderings with respect to increasing j coincide so long as each point is at 
least 5 away from all the others, since we may view dG as a random draw 
from the distribution dG followed by a random "jump" forward uniformly 
distributed in [0, 5]. The probability that such an m-tuple has two points at 



dojj 



du? 



< 9en m /(m- 1)! 



(5) 



l v^fc 
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most 5 apart is bounded by the probability that some pair of its points are 



that close, i.e., 



m \ l x+s 
< I ) max / dG 



-6 

i ' r-x+5 



2 

771 

2, 

m ) max i V (C(Cj, x + 8)- C(C jt x - 5)) 

2 1 x k ^— ' 

/ j= i 

m\ | U, oiPj) n [(x - <5)n, (x + S)n) \ 
max 



max 

it 



2/ ^ fc|Ci| 

m 2 25n r.. . , 9 _ 
< — ■ jj-^ < m 2 5(l - e)- 1 < 2m 2 5. 

Since cko g (or dojg) is the same as the distribution of an unordered m-tuple 
drawn from dG (resp., dG) minus the event that two points are drawn from 
the same Cj, 



dojg — / dujg 



< (2e 1/2 m 2 + 2/k)n m /m\, (6) 



'33 JB' 

where the second summand follows from the fact that 

* V ^ < J _ (1 _ Olzly. < l/k . 
mj k m ~ K k 

Similarly, if we define dF = k~ x Y^j=i dfj, dF = k~ l J2j=i dfj, F(x) = J* dF, 
and G(x) = J* dG, the difference of the two integrals in question is bounded 
by (™) ( n ) times the probability that two points chosen from dF are within 
5, or 



< I m ) max / dF 



x+S 
x—S 

max(F(x + 5) - F(x - 5)) 



X 



( ™ ) (max(G(x + 5 + e) - G(x - 5 - e)) + 2e) 
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<m 2 (5 + 2e)(l - e)" 1 < 6e 1/2 



m 



And so, 



duj — I duj 
s Jt>' 



< (6e 1/2 m 2 + 2/k)n m /m\ 
Putting together (@J), (JSJ), (0), and (J7J), we have 



(7) 



duj, 



< (20e 1/2 m 2 + A/k)n m /m\ 



□ 



7 The Proof of Theorem U 

For the proof of Theorem ^ below, we are heavily indebted to [Sj, which we 
find to have the most comprehensible - if not the shortest - proof of the 
Regularity Lemma in the literature. 

For disjoint sets X,Y G Z n , define the "index" 

q(X,Y) = \X\\Y\d 2 (X,Y)/n 2 . 

Then extend this definition to a pair of partitions X of X and y of Y by 
q(X, y) = J2x>ex,Y'&y ?(^'' ^')- For a partition "P = {Cj}j =1 of Z n , we write 
= ^2ij<l(,Ci,Cj). If one set in the partition, C , has been designated 
as an exceptional set, then we treat Co as a collection of singletons in this 
sum. That is, we write V for the partition which refines V by splitting Co 
into singletons, then q(V) = q(V,V). First of all, note that 

q(v) = J2 MEd^Q, Cj) < n" 2 \Ci\ |Q| = 1. 



'■J 



Now, we have the following simple lemma, which says that refinement 
can only increase the index of a partition. 

Lemma 12. 

1. Let C,D C Z n fnot necessarily disjoint). If C is a partition of C and 
T> is a partition of D, then q(C,T>) > q(C,D). 



2. IfV,V are partitions of1 n and V refines V, then q(V) > q{V). 
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Proof. 



1. Let C = {Ci, . . . , C k } and V = {D u . . . , A}- Then 

a(C V) = ±Y P{Ci ' Di)2 
<& L > U ) n 2L \d\\DA 

~n* EijiaiPil 

(Eiiai)(E,-Pil) 

= q(C,D), 

where the inequality follows from Cauchy-Schwarz. 

2. Let T- 3 = {Ci, . . • , Cfc}, and for 1 < % < k, let Ci be the partition of 
induced by P'. Then 

where the inequality follows from part (1). 

□ 

The next lemma says that we may exploit irregular pairs to increase the 
index somewhat. 

Lemma 13. Let e > 0, and let C,D C Z„ be intervals. If (C, D) is not 
e-regular with respect to a, then there are partitions C and T> of C and D, 
respectively, so that 

q (C,V)>q(C,D) + e* l -^M. 

Proof. Suppose (C, D) is not e-regular, and choose intervals C\ C C and 
D\ C D with \C\\ > e\C\ and \D\\ > e\D\ so that \rj\ > e, where t] = 
d(d, Di) - d(C, D). Let C = {d, C 2 , C 3 } and V = {D u D 2 , D 3 }, where C 2 
is the "left half" of C\Ci, C 3 is the "right half"; and D 2 and D 3 are defined 
similarly. (That is, C\ splits the interval C into three pieces: C\ itself, one 
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interval of points less than those of C\ and one interval of points greater than 
those of C\. Either, but not both, of these may be empty.) 

For ease of notation, write c, = |Cj|, dj = \Di\, Cij = p(Ci,Dj), c = |C|, 
d = \D\, and e = p(C,D). Then, applying Cauchy-Schwarz again, we see 



2 1 / „2 e 2 



q(C V) = — V = — + V 

>]_(4 L+ (e-e li y 



n 2 \c\d\ cd — C\d\ 
Since en = c\d\cjcd + r\c\d\, we have 



2 / 

2 , , ± / cidie \ 1 / cd — c\d\ 



n q(C, V) > — - — — + r^cida + — — e - ncid] 

Cirfi \ cot J cd — C\di \ cd 

e 2 rfcidicd 

cd cd — c\d\ 

cd 



e 2 



since c\ > ec, di > ed, and n 2 > e 2 . □ 

The following lemma is the crux of the proof of Theorem ^ 

Lemma 14. Lei < e < 1/4 and G N, let a be a permutation of Z n , 
and /ei P 6e an equitable partition of Z„ into {Cj}*_ itrata |C 1 < en and 
|(7j| > 81 fc for j > 0. //P is not e-regular, then there is an equitable partition 
V = {Cj} l j =0 of Z n with exceptional set C , where k < I < k81 h , such that 
\Cq\ < | C 1 +n/9 fc and 

g(P') > g(P) + e 5 /2- 

Proof. Let c = |Ci|. For all 1 < i,j < k, define a partition C}j of C; and 
a partition C|j of Cj as follows. If the pair (C Cj) is e-regular, then let 
C\a = {Ci} and C|j = {Cj}. If not, then by the previous lemma, there are 
tripartitions C\,- and C 2 { of Cj and Cj, respectively, so that 



q(Cl p C] i )>q(C i ,C j ) + 



eV 
n 2 



21 



For each i = l,...,k, let Cj be the partition of C, that is the common 
refinement of every partition Q.-. Note that |Cj| < 9 k . Now, consider the 
partition 

k 

c = {c }u\Jc l , 

i=l 

with Cq as exceptional set. Then C refines V and k < \C\ < k9 h . 

Let C = {{x} : x G C }. If P is not e-regular, then for more than ek 2 of 
the pairs (Cj, Cj), the partitions and Cj, are nontrivial. Hence, by Lemma 
d 

g(C) = q(Ci, Cj) + g(C , d) 

i,j>l i>0 
i,j>l i>l 

4 2 

> £ g(a, C;) + efc 2 ^- + ^ q(Co, {C}) + q(C ) 

i,j>l i>l 

> g(P) + e 5 /2, 
since fee > 3n/4. 

Now, C satisfies the conclusions of the theorem, except that it may not 
be equitable. To fix the situation, cut each non-exceptional block of C into a 
maximal collection of disjoint intervals of size d = [c/81 fc J . Call the resulting 
set of intervals {Cj}^ =1 , and let C' = Z n \|JCj. This new partition V refines 
C, so 

q(V')>q(C)>q(V)+e 5 /2. 

Since each set Cj, j > 0, is contained in one of the sets Cj, but not more 
than 81 fc sets can lie inside the same Cj, we also have k < I < k81 k . On 
the other hand, the sets C[, . . . , C{ use all but at most d points from each 
nonexceptional block of C. Therefore, 

\C' \ < \C \+d\C\ 
= \C \ + ck9~ k 
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< I C 1 +nQ~ k . 

□ 

Now, since q(V) < 1, this lemma cannot be applied ad infinitum. Indeed, 
we may now complete the proof of Theorem 

Proof of Theorem^ Let e > and m > 1. Without loss of generality, 
e < 1/4. Let s = |~2/e 5 ]. If we apply Lemma IT41 s times, we end up with a 
partition of a which is e-regular. 

However, we need to choose the "starting" partition so that the excep- 
tional set ends up with cardinality < en and the Cj are sufficiently large at 
each stage. With each iteration of the lemma, the size of the exceptional set 
can grow by at most n/9 k . Therefore, we wish to choose k large enough so 
that s increments of n/9 k add up to at most en/2, and n large enough so that 
| C 1 < k implies |C | < en/ 2. (We can guarantee |C | < k if we begin with 
an equitable fc-partition.) So let k > m be large enough so that 9 k > 2s /e. 
Then s/9 k < e/2, and hence 

s 

k H — -n < en 
gk - 

whenever n > 2k /e. 

Now, define f(x) = x81 x . We may take M = max{/ s (fc), 2k/ e}. To 
deal with the second condition - that the blocks be sufficiently large at each 
stage - note that, after s steps, the nonexceptional blocks sizes are at least 
n/(2M) s . Therefore, choosing N = max{2M/e, 81 M (2M) S } suffices, and the 
proof is complete. □ 

8 Concluding Remarks 

The discussion of Section El is largely "local", i.e., the analysis is concerned 
with the internal structure of individual blocks of the uniform partition. 
Section |B] consists of a "global" analysis - it does not take into account the 
internal structure of the blocks, only their relationships with one another. 
On the other hand, the proofs of the main results of Sections 0] and El as well 
as that of Proposition El are both. It is here, in the interplay between local 
and global, that we believe the most interesting behavior resides. We sus- 
pect that such dual analysis may lead to a better understanding of extremal 
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permutations in the senses of Problem 2 and 3 of the Introduction, perhaps 
using the results of Section |U1 Theorem ^2 i n theory, gives a translation of 
these problems from combinatorial to analytic. We are hopeful that Theo- 
rem can find application in algorithmic settings, e.g., in the vein of [H], 
or in other contexts where a "counting lemma" has been useful, such as the 
hypergraph-theoretic proof of the Szemeredi Theorem. 
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